Frame-level Nonlinearity for Robust DTW-based Speaker Verification
نویسندگان
چکیده
Dynamic time warping (DTW) is a successful algorithm in many matching and searching tasks. For the text-dependent speaker verification, it is still an appropriate choice when enrollment data are very limited. Yet DTW is very sensitive to the endpoint variations between the reference template and test examples. Most research reported on this issue is mainly in two directions: robust endpoint detector and endpoint constraint relaxation. In this paper, we intend to propose the third possible solution by employing a frame-level nonlinear transform. The parameter for the transform function may be universal, template-dependent or frame-dependent. This method is also able to realize the normalization of DTW matching distance at the same time. Results indicate that the performance of text-dependent speaker verification can be enhanced remarkably in both clean and noisy environments. Their relative reductions of EER are 20.6% and 35.0% respectively. We expect the proposed method may be effective in other DTW applications as well.
منابع مشابه
Using Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملComparing DTW-Based and HMM-Based Text- Dependent Speaker Verification Algorithms
Speaker verification is among the widely used biometrics which usually offer more secure authentication for user access than regular passwords. In this final project, we study the DTW-based and HMM-based speaker verification algorithms and a comparison between them is made based on their performances on our recorded dataset. The two feature sets commonly used in Speech Recognition Systems, LPC ...
متن کاملA robust speaker verification system against imposture using an HMM-based speech synthesis system
This paper describes a text-prompted speaker verification system which is robust to imposture using synthetic speech generated by an HMM-based speech synthesis system. In the verification system, text and speaker are verified separately. Text verification is based on phoneme recognition using HMM, and speaker verification is based on GMM. To discriminate synthetic speech from natural speech, an...
متن کاملComparison of Vq and Dtw Classifiers for Speaker Verification
An investigation into the relative speaker verification performance of various types of vector quantisation (VQ) and dynamic time warping (DTW) classifiers is presented. The study covers a number of algorithmic issues involved in the above classifiers, and examines the effects of these on the verification accuracy. The experiments are based on the use of a subset from the Brent (telephone quali...
متن کامل